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Abstract 

We present a new piggybacking framework for designing distributed storage codes that are efficient in data- 
read and download required during node-repair. We illustrate the power of this framework by constructing classes 
of explicit codes that entail the smallest data-read and download for repair among all existing solutions for three 
important settings: (a) codes meeting the constraints of being Maximum-Distance-Separable (MDS), high-rate and 
having a small number of substripes, arising out of practical considerations for implementation in data centers, (b) 
binary MDS codes for all parameters where binary MDS codes exist, (c) MDS codes with the smallest repair- 
locality. In addition, we employ this framework to enable efficient repair of parity nodes in existing codes that were 
originally constructed to address the repair of only the systematic nodes. The basic idea behind our framework is 
to take multiple instances of existing codes and add carefully designed functions of the data of one instance to the 
other. Typical savings in data-read during repair is 25% to 50% depending on the choice of the code parameters. 



I. Introduction 

Distributed storage systems today are increasingly employing erasure codes for data storage, since erasure codes 
provide much better storage efficiency and reliability as compared to replication-based schemes |TJ-|[3j. Frequent 
failures of individual storage nodes in these systems mandate schemes for efficient repair of failed nodes. In 
particular, upon failure of a node, it is replaced by a new node, which must obtain the data that was previously 
stored in the failed node by reading and downloading data from the remaining nodes. Two primary metrics that 
determine the efficiency of repair are the amount of data read at the remaining nodes (termed data-read) and the 
amount of data downloaded from them (termed data-download or simply the download). 
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Fig. 1: An example illustrating efficient repair of systematic nodes using the piggybacking framework. Two instances of a 
(6,4) MDS code are piggybacked to obtain a new (6,4) MDS code that achieves 25% savings in data-read and download in 
the repair of any systematic node. A highlighted cell indicates a modified symbol. 
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Fig. 2: An example illustrating of the mechanism of repair of parity nodes under the piggybacking framework, using two 



instances of the code of Fig. lc A shaded cell indicates a modified symbol. 



In this paper, we present a new framework, which we call the piggybacking framework, for design of repair- 
efficient storage codes. In a nutshell, this framework considers multiple instances of an existing code, and the 
piggybacking operation adds (carefully designed) functions of the data of one instance to the other. We design 
these functions with the goal of reducing the data-read and download requirements during repair. Piggybacking 
preserves many of the properties of the underlying code such as the minimum distance and the field of operation. 

We need to introduce some notation and terminology at this point. Let n denote the number of (storage) nodes 
and assume that the nodes have equal storage capacities. The data to be stored across these nodes is termed the 
message. A Maximum-Distance-Separable (MDS) code is associated to another parameter k: an [n, k] MDS code 
guarantees that the message can be recovered from any k of the n nodes, and requires a storage capacity of \ of 
the size of the message at every node. It follows that an MDS code can tolerate the failure of any (n — k) of the 
nodes without suffering any permanent data-loss. A systematic code is one in which k of the nodes store parts of 
the message without any coding. These k nodes are termed the systematic nodes and the remaining (n — k) nodes 
are termed the parity nodes. We denote the number of parity nodes by r = (n — k). We shall assume without loss 
of generality that in a systematic code, the first k nodes are systematic. The number of substripesof a (vector) code 
is defined as the length of the vector of symbols that a node stores in a single instance of the code. 

The piggybacking framework offers a rich design space for constructing codes for various different settings. We 
illustrate the power of this framework by providing the following four classes of explicit code constructions in this 
paper. 

(Class 1) A class of codes meeting the constraints of being MDS, high-rate, and having a small number of 
substripes, with the smallest known average data-read for repair: A major component of the cost of current day 
data-centers which store enormous amounts of data is the storage hardware. This makes it critical for any storage 
code to minimize the storage space utilization. In light of this, it is important for the erasure code employed to be 
MDS and have a high-rate (i.e., a small storage overhead). In addition, practical implementations also mandate a 
small number of substripes. There has recently been considerable work on the design of distributed storage codes 
with efficient data-read during repair [4)-|26|. However, to the best of our knowledge, the only explicit codes that 
meet the aforementioned requirements are the Rotated- RS (8j codes and the (repair-optimized) EVENODD (25j, 
1 27 1 and RDP |26|, [28 1 codes. Moreover, Rotated-RS codes exist only for r G {2, 3} and k < 36; the (repair- 
optimized) EVENODD and RDP codes exist only for r = 2. Through our piggybacking framework, we construct a 
class of codes that are MDS, high-rate, have a small number of substripes, and require the least amount of data-read 
and download for repair among all other known codes in this class. An appealing feature of our codes is that they 
support all values of the system parameters n and k. 

(Class 2) Binary MDS codes with the lowest known average data-read for repair, for all parameters where 
binary MDS codes exist: Binary MDS codes are extensively used in disk arrays 1 27 1, [28 ]. Through our piggybacking 
framework, we construct binary MDS codes that require the lowest known average data-read for repair among all 
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existing binary MDS codes (8), (25j-(29j. Furthermore, unlike the other codes and repair algorithms |8j, p5|-p9| 
in this class, the codes constructed here also optimize the repair of parity nodes (along with that of systematic 
nodes). Our codes support all the parameters for which binary MDS codes are known to exist. 

(Class 3) Efficient repair MDS codes with smallest possible repair-locality: Repair-locality is the number of 
nodes that need to be read during repair of a node. While several recent works p0j-|[23| present codes optimizing 
on locality, these codes are not MDS and hence require additional storage overhead for the same reliability levels 
as MDS codes. In this paper, we present MDS codes with efficient repair properties that have the smallest possible 
repair-locality for an MDS code. 

(Class 4) A method of reducing data-read and download for repair of parity nodes in existing codes that 
address only the repair of systematic nodes: The problem of efficient node -repair in distributed storage systems 



has attracted considerable attention in the recent past. However, many of the codes proposed |7J-|9J, |T9J, pOj 
have algorithms for efficient repair of only the systematic nodes, and require the download of the entire message 
for repair of any parity node. In this paper, we employ our piggybacking framework to enable efficient repair of 
parity nodes in these codes, while also retaining the efficiency of repair of systematic nodes. The corresponding 
piggybacked codes enable an average saving of 25% to 50% in the amount of download and read required for 
repair of parity nodes. 

The following examples highlight the key ideas behind the piggybacking framework. 

Example 1: This example illustrates one method of piggybacking for reducing data-read during systematic node 



repair. Consider two instances of a (6,4) MDS code as shown in Fig. la with the 8 message symbols {ai}f =1 



and {bi}f =1 (each column of Fig. la depicts a single instance of the code). One can verify that the message can 
be recovered from the data of any 4 nodes. The first step of piggybacking involves adding Ylf=i to ^ second 
symbol of node 6 as shown in Fig. [77?] The second step in this construction involves subtracting the second symbol 
of node 6 in the code of Fig. ^Tb^from its first symbol. The resulting code is shown in Fig. 1c This code has 2 
substripes (the number of columns in Fig. lc). 

We now present the repair algorithm for the piggybacked code of Fig. [7c] Consider the repair of node 1. Under 
our repair algorithm, the symbols &2, i>3 5 ^4 cind 2Za=i &i are download from the other nodes, and b\ is decoded. 
In addition, the second symbol (X^i=i + 2~2i=i ^ a «) °f node 6 is downloaded. Subtracting out the components of 
{bi}j =1 gives the piggyback Y^a=i ^ a i- Finally, the symbol a<i is downloaded from node 2 and subtracted to obtain 
a\. Thus, node 1 is repaired by reading only 6 symbols which is 75% of the total size of the message. Node 2 can 
be repaired in a similar manner. Repair of nodes 3 and 4 follows on similar lines except that the first symbol of 
node 6 is read instead of the second. 

The piggybacked code is MDS, and the entire message can be recovered from any 4 nodes as follows. If node 6 
is one of these four nodes, then add its second symbol to its first, to recover the code of Fig. lb Now, the decoding 
algorithm of the original code of Fig, la is employed to first recover {ai}f =1 , which then allows for removal of 
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the piggyback (J2 i=1 idi) from the second substripe, making the remainder identical to the code of Fig. 

Example 2: This example illustrates the use of piggybacking to reduce data-read during the repair of parity 
nodes. The code depicted in Fig. [2] takes two instances of the code of Fig. [7c] and adds the second symbol of node 
6> Ci=i ibi + 2~2i=i * a «) (which belongs to the first instance), to the third symbol of node 5 (which belongs to the 
second instance). This code has 4 substripes (the number of columns in Fig. [2]). In this code, repair of the second 
parity node involves downloading {ai,Ci, di}f =1 and the modified symbol (2~2i=i c « + Ylt=i + 2~2i=i * a «)> using 
which the data of node 6 can be recovered. The repair of the second parity node thus requires read and download 
of only 13 symbols instead of the entire message of size 16. The first parity is repaired by downloading all 16 
message symbols. Observe that in the code of Fig. [7c] the first symbol of node 5 is not used for repair of any of 
the systematic nodes. Thus the modification in Fig. [2] does not change the algorithm or the efficiency of the repair 
of systematic nodes. The code retains its MDS property: the entire message can be recovered from any 4 nodes by 
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first decoding {aj, bi}f =i using the decoding algorithm of the code of Fig. la which then allows for removal of the 



piggyback (X^i=i + Sj=i ^ a i) f rom the second instance, making the remainder identical to the code of Fig. 



la 



Our piggybacking framework, enhances existing codes by adding piggybacks from one instance onto the other. 
The design of these piggybacks determine the properties of the resulting code. In this paper, we provide a few 
designs of piggybacking and specialize it to existing codes to obtain the four specific classes mentioned above. 
This framework, while being powerful, is also simple, and easily amenable for code constructions in other settings 
and scenarios. 

The rest of the paper is organized as follows. Section [n] introduces the general piggybacking framework. 
Sections III and IV then present code designs and repair-algorithms based on this framework, special cases of 
which result in classes 1 and 2 discussed above. Section [V] provides piggyback design which result in low-repair 
locality along with low data-read and download. Section [VI] provides a comparison of these codes and various other 
codes in the literature. Section VII demonstrates the use of piggybacking to enable efficient parity repair in existing 
codes that were originally constructed for repair of only the systematic nodes. Section VIII draws conclusions. 



Readers interested only in repair locality may skip Sections HTl] and IV and readers interested only in the 



mechanism of imbibing efficient parity-repair in existing codes optimized for systematic-repair may skip Sections III 
IV and M without any loss in continuity. 



II. The Piggybacking Framework 

The piggybacking framework operates on an existing code, which we term the base code. The choice of the base 
code is arbitrary. The base code is associated to n encoding functions {A}r=i : ^ takes the message u as input and 
encodes it to n coded symbols {/i(u), . . . , / n (u)}. Node i (1 < i < n) stores the data A(u). 

The piggybacking framework operates on multiple instances of the base code, and embeds information about one 
instance into other instances in a specific fashion. Consider a instances of the base code. The encoded symbols in 
a instances of the base code are 



Node 1 


A (a) 


A(b) 




AW 












Node n 


/n(a) 


fn(b) 




fn(z) 



where a, . . . , z are the (independent) messages encoded under these a instances. 

We shall now describe the piggybacking of this code. For every i, 2 < i < a, one can add an arbitrary function 
of the message symbols of all previous instances {1, . . . , (i — 1)} to the data stored under instance i. These functions 
are termed piggyback functions, and the values so added are termed piggybacks. Denoting the piggyback functions 
by gtj (i 6 {2, . . . , a}, j G {1, . . . , n}), the piggybacked code is thus: 
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A(a) 




A(c) + 93,1 ( a > b ) 




/i(z) +5a,i(a,...,y) 














Node n 


/n(a) 


AO) +52,n(a) 


A(c) + 3 ,n(a,b) 




/n(z) +5a,n(a,...,y) 



The decoding properties (such as the minimum-distance or the MDS nature) of the base code are retained upon 
piggybacking. In particular, the piggybacked code allows for decoding of the entire message from any set of nodes 
from which the base code allowed decoding. To see this, consider any set of nodes from which the message can 
be recovered in the base code. Observe that the first column of the piggybacked code is identical to a single 
instance of the base code. Thus a can be recovered directly using the decoding procedure of the base code. The 
piggyback functions {fl'2,i(a)}™ =1 can now be subtracted from the second column. The remainder of this column 
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is precisely another instance of the base code, allowing recovery of b. Continuing in the same fashion, for any 
instance i (2 < i < n), the piggybacks (which are always a function of previously decoded instances {1, . . . , i — 1}) 
can be subtracted out to obtain the base code of that instance which can be decoded. 

The decoding properties of the code are thus not hampered by the choice of the piggyback functions <7j,/s. This 
allows for flexibility in the choice of the piggyback functions, and these need to be picked cleverly to achieve the 
desired goals (such as efficient repair, which is the focus of this paper). 

The piggybacking procedure described above was followed in Example [T] to obtain the code of Fig. lb from 



Fig. la Subsequently, in Example [2j this procedure was followed again to obtain the code of Fig. [2] from Fig. lc 



The piggybacking framework also allows any invertible linear transformation of the data stored in any individual 



node. In other words, each node of the piggybacked code (e.g., each row in Fig. lbi can separately undergo a 
invertible transformation. Clearly, any invertible transformation of data within the nodes does not alter the decoding 
capabilities of the code, i.e., the message can still be recovered from any set of nodes from which it could be 
recovered in the base code. In Example[TJ the code of Fig. lc is obtained from Fig. lb via an invertible transformation 
of the data of node 6. 

The following theorem formally proves that piggybacking does not reduce the amount of information stored in 
any subset of nodes. 

Theorem 1: Let U\, . . . , U a be random variables corresponding to the messages associated to the a instances 
of the base code. For i G {1, ... , n}, let X{ denote the data stored in node i under the base code. Let Yj denote 
the encoded symbols stored in node i under the piggybacked version of that code. Then for any subset of nodes 
SC{l,...,n}, 

1 {{Yihes ;Ui,...,U a )>I {{Xi} ieS ; U u . . . , U a ) . (1) 

The proof of this theorem is provided in the appendix. 

Corollary 2: Piggybacking a code does not decrease its minimum distance; piggybacking an MDS code preserves 
the MDS property. 

Notational Conventions: For simplicity of exposition, we shall assume throughout this section that the base 
codes are linear, scalar, MDS and systematic. Using vector codes (such as EVENODD or RDP) as base codes is a 
straightforward extension. The base code operates on a /c-length message vector, with each symbol of this vector 
drawn from some finite field. The number of instances of the base code during piggybacking is denoted by a, and 
{a, b, . . .} shall denote the fc-length message vectors corresponding to the a instances. Since the code is systematic, 
the first k nodes store the elements of the message vector. We use pi, ... ,p r to denote the r encoding vectors 
corresponding to the r parity symbols, i.e., if a denotes the A>lengfh message vector then the r parity nodes under 
the base code store a, . . . , p^a. 

The transpose of a vector or a matrix will be indicated by a superscript T . Vectors are assumed to be column 
vectors. For any vector v of length k, we denote its k elements as v = [v\ • • • v K ] T , and if the vector itself has 
an associated subscript then we its elements as Vj = [v^i ■ ■ ■ t>i )K ] T - 

Each of the explicit codes constructed in this paper possess the property that the repair of any node entails 
reading of only as much data as what has to be downloaded. [] This property is called repair-by-transfer (24j . Thus 
the amounts of data-read and download are equal under our codes, and hence we shall use the same notation 7 to 
denote both these quantities. 



'in general, the amount of download lower bounds the amount of read, and the download could be strictly smaller if a node passes a 
(non-injective) function of the data that it stores. 
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III. Piggybacking Design 1 

In this section, we present our first design of piggyback functions and associated repair algorithms. This design 
allows one to reduce data-read and download during repair while having a small number of substripes. For instance, 
when the number of substripes is is small as 2, we can achieve a 25 to 35% savings during repair of systematic 
nodes. We shall first present the piggyback design for optimizing the repair of systematic nodes, and then move 
on to the repair of parity nodes. 



A. Efficient repair of systematic nodes 



This design operates on a = 2 instances of the base code. We first partition the k systematic nodes into r 
sets, Si,...,S r . of equal size (or nearly equal size if k is not a multiple of r). For ease of understanding, let 
us assume that A; is a multiple of r, which fixes the size of each of these sets as -. Then, let 5*1 = {!,...,-}, 



S 2 = {£ + 1, . . . , f } and so on, with S t = {™ + 1 
Define the following /c— length vectors: 

q 2 = [ p r ,l • • • Vr,\ • • • 

q3 = [ • • • ■ ■ ■ Pr,^ • • 



} for i = 1, . . . ,r. 








r 

IT 



qr 

qr+i 

Also, let 



••• p r> A (r _ 2)+1 ••• P P ,A( r _i) ••• ] T 

•••0 P Pj 4( r _i)+i ••• Pr,k f 



= p r - q r 

= i Pr,l ••• ••• ••• Pr,£(r-2) ■•' Pr±(r-l)+l ' ' ' Pr,k ? 

Note that each element pij is non-zero since the base code is MDS. We shall use this property during repair 
operations. 

The base code is piggybacked in the following manner: 
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P2 a 


P2 h + qfa 






T 





Fig. lb depicts an example of such a piggybacking. 

We shall now perform an invertible transformation of the data stored in node (k + r). In particular, the first 
symbol of node (k + r) in the code above is replaced with the difference of this symbol from its second symbol, 
i.e., node (k + r) now stores 
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Node k+r 



T 
vi a 



The other symbols in the code remain intact. This completes the description of the encoding process. 

Next, we present the algorithm for repair of any systematic node £ (e {1, . . . , k}). This entails recovery of the 
two symbols ag and bi from the remaining nodes. 

Case I (£ ^ S r ): Without loss of generality let £ £ S\. The k symbols {61, . . . , bi+i, p^fb} are 
downloaded from the remaining nodes, and the entire vector b is decoded (using the MDS property of the base 
code). It now remains to recover ag. Observe that the £ th element of q2 is non-zero. The symbol (p^b + q^a) is 
downloaded from node (k + 2), and since b is completely known, p5Tb is subtracted from the downloaded symbol 
to obtain the piggyback q^a. The symbols {a^i^s^e} w& a l so downloaded from the other systematic nodes in 
set S\. The specific (sparse) structure of q2 allows for recovering ag from these downloaded symbols. Thus the 
total data-read and download during the repair of node I is (fe+ -) (in comparison, the size of the message is 2k). 

Case 2 (S = S r ): As in the previous case, b is completely decoded by downloading {61, . . . , £^_i, bg + \, • • • , pf b}. 
The first symbol (v^Ta — p^b) of node (k + r) is downloaded. The second symbols {pf b + qf a}j stored in the 
parities % G {(k + 2), . . . , + r — 1)} are also downloaded, and are then subtracted from the first symbol of node 
(k + r). This gives (q^ +1 a + w T b) for some vector w. Using the previously decoded value of b, v T b is removed 
to obtain q^ +1 a. Observe that the £ th element of q r +i is non-zero. The desired symbol ag can thus be recovered by 
downloading {a* ( r _ 1 ) +1 , . . . , at-%, fl^+i, • • • , afc} from the other systematic nodes in S r . The total data-read and 
download required in recovering node £ is (k + ^ + r — 2). 

Observe that the repair of systematic nodes in the last set S r requires more read and download as compared to 
repair of systematic nodes in the other sets. Given this observation, we do not choose the sizes of the sets to be 
equal (as described previously), and instead optimize the sizes to minimize the average read and download required. 



For % = 1, . . . , r, denoting the size of the set Si by U 

t\ = • • • = t r -\ 



the optimal sizes of the sets turn out to be 
2" 



k r 
r 2r 
k-(r - l)t 



(2) 
(3) 

The amount of data read and downloaded for repair of any systematic node in the first (r — 1) sets is (k + 1), and 
the last set is (k + t r + r — 2). Thus, the average data-read and download 7^ for repair of systematic nodes, as a 
fraction of the total number 2k of message symbols, is 

P - t r ) (k + t) + t r (k + t r + T ~ 2)\ . (4) 



sys 

7i 



2A; 2 



This quantity is plotted in Fig. 5a for various values of the system parameters n and k. 



B. Reducing data-read during repair of parity nodes 



We shall now piggyback the code constructed in Section III-A to introduce efficiency in the repair of parity nodes, 



while also retaining the efficiency in the repair of systematic nodes. Observe that in the code of Section III-A 



the first symbol of node (k + 1) is never read for repair of any systematic node. We shall add piggybacks to this 
unused parity symbol to aid in the repair of other parity nodes. 



This design employs m instances of the piggybacked code of Section III-A The number of substripes in the 
resultant code is thus 2m. The choice of m can be arbitrary, and higher values of m result in greater repair-efficiency. 
For every instance i € {2, 4, ... , 2m — 2}, the (r — 1) parity symbols in nodes (k + 2) to (k + r) are summed up. 
The result is added as a piggyback to the (i + l) th symbol of node (k + 1). The resulting code, when m = 2, is 
shown below. 
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Node k 
Node k+1 
Node k+2 

Node k+r-1 
Node k+r 





7 


Cl 


di 












T 

Ok 




7 

dk 


T 

Pi a 


Pib 
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p^_ib + q^a 


T 

Pr 1 C 


Pr-i d + q^-i c 


vJTa — p^b 


pJTb + q^Ta 


v^Tc — p^d 


p^d + q^c 



This completes the encoding procedure. The code of Fig. [2] is an example of this design. 

As shown in Section InJ the piggybacked code retains the MDS property of the base code. In addition, the 



repair of systematic nodes is identical to the that in the code of Section III-A since the symbol modified in this 
piggybacking was never read for the repair of any systematic node in the code of Section III-A 

We now present an algorithm for efficient repair of parity nodes under this piggyback design. The first parity node 
is repaired by downloading all Irak message symbols from the systematic nodes. Consider repair of some other 
parity node, say node I 6 {k + 2, . . . , k + r}. All message symbols a, c, ... in the odd substripes are downloaded 
from the systematic nodes. All message symbols of the last substripe (e.g., message d in the m = 2 code shown 
above) are also downloaded from the systematic nodes. Further, the {3 rd , 5 th , . . . , (2m — l) th } symbols of node 
(k + 1) (i.e., the symbols that we modified in the piggybacking operation above) are also downloaded, and the 
components corresponding to the already downloaded message symbols are subtracted out. By construction, what 
remains in the symbol from substripe i (G {3, 5, . . . , 2m — 1}) is the piggyback. This piggyback is a sum of the 
parity symbols of the substripe (i — 1) from the last (r — 1) nodes (including the failed node). The remaining 
(r — 2) parity symbols belonging to each of the substripes {2, 4, ... , 2m — 2} are downloaded and subtracted out, 
to recover the data of the failed node. The procedure described above is illustrated via the repair of node 6 in 
Example [2j 

The average data-read and download 7^ ai for repair of parity nodes, as a fraction of the total message symbols, 

is 



par 

Ti 



1 

2kr 



2k + (r - 1) 1 + — )k + 1 



in 



1 



m 



(r-1) 



This quantity is plotted in Fig. 5b for various values of the system parameters n and k. 



IV. Piggybacking Design 2 

The design presented in this section provides a higher efficiency of repair as compared to the previous design. 
On the downside, it requires a larger number of substripes: the minimum number of substripes required under the 
design of Section III-A is 2 and under that of Section III-B is 4, while that required in the design of this section 
is (2r — 3). The following example illustrates this piggybacking design. 

Example 3: Consider some (n = 13, k = 10) MDS code as the base code, and consider a = (2r — 3) = 3 
instances of this code. Divide the systematic nodes into two sets of sizes 5 each as S\ = {1, ... ,5} and S2 = 
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{6, . . . , 10}. Define 10-length vectors q2, V2, q3 a«ci V3 as 



q2 


= [P2,l ■ 


•• P2,5 





• o] 


v 2 


= [o 


•• 


P2,6 • 


• P2,w] 


q3 


= [o 


•• 


P3,6 • 


■ P3,10] 


V3 


= [P3,l • 


•• P3,5 





•• 0] 



Now piggyback the base code in the following manner 
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Pi a 


pfb 


T 
PlC 


Node 12 


P2 a 


Pifb + q^a 


Pl'c + q^b + q^a 


Node 13 


T 
P3 a 


P3 b + qj a 


P3C + q^b + qja 



Next, we take invertible transformations of the (respective) data of nodes 12 and 13. The second symbol of node 
i G {12, 13} in the new code is the difference between the second and the third symbols of node i in the code 
above. The fact that (p2 — q2) = ^2 and (p3 — q^) = V3 results in the following code 



Node 1 


a\ 


bi 


Cl 










Node 10 




ho 


ClO 


Node 11 


T 

Pi a 


pfb 


T 
PlC 


Node 12 


T 
P2 a 


vjb - p^c 


P2 c + q^b + qfa 


Node 13 


T 
P3 a 


vjb - p^c 


pIc + q^b + qfa 



This completes the encoding procedure. 

We now present an algorithm for (efficient) repair of any systematic node, say node 1. The 10 symbols {c2, . . . , C10, pfc} 
are downloaded, and c w decoded. It now remains to recover a\ and b\. The third symbol (p^c + q^b + q^a) 
of node 12 is downloaded and p 2 c subtracted out to obtain (q^b + qja). The second symbol (vqTb — p^c) from 
node 13 w downloaded and (—pjc) is subtracted out from it to obtain v^b. The specific (sparse) structure of 
and V3 allows for decoding a\ and b\ from (q^b + q<fa) and vjb, by downloading and subtracting out {aj}^ =2 
and {6j}f =2 - Thus, the repair of node 1 involved reading and downloading 20 symbols (in comparison, the size of 
the message is ka = 30). The repair of any other systematic node follows a similar algorithm, and results in the 
same amount of data-read. 

The general design is as follows. Consider (2r — 3) instances of the base code, and let ai,.. . a2 r _3 be the 
messages associated to the respective instances. First divide the k systematic nodes into (r — 1) equal sets (or 
nearly equal sets if k is not a multiple of (r — 1)). Assume for simplicity of exposition that k is a multiple of 
(r — 1). The first of the ^-j- sets consist of the first nodes, the next set consists of the next ^j-j- nodes and so 
on. Define A;-length vectors {vj, Vj}[ =2 as 

Vj = a r _i + ia r _ 2 + « 2 a r -3 H 1- « r ~ 2 ai 

Vj = Vj - a r _i = ia r _ 2 + « 2 a r -3 H 1- z r_2 ai . 
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Further, define /c-length vectors {qjj}^ j=i as 



p* 



o. 

where the positions of the ones on the diagonal of the (k x k) diagonal matrix depicted correspond to the nodes 
in the j th group. It follows that 

r-1 

^2<li,j =Pi Vi£ {2, . . . r} . 
Parity node (A; + i), i € {2, . . . , r}, is then piggybacked to store 



T 

p- ai 




pfa r _ 2 


Pfar-l+EiJ^-iqJjVi 


pf a »-+q^i v i 




pfa r+ i_ 3 +q^_ 2 Vi 


pf a r+q^vi 




pfa 2r - 3 +q^ r _iV l 



Following this, an invertible linear combination is performed at each of the nodes {k + 2, . . . ,k + r}. The transform 
subtracts the last (r— 2) substripes from the (r— l) th substripe, following which the node (k+i), i 6 {2, . . . , r}, stores 



T 

Pi ai 




pfa r _ 2 


qi,i-l a r-l-Ei=r Pi ' a j 


Pfar + q^vi 




pfa r+i _ 3 + q^_ 2 Vi 


Pfa r +q^vi 




pfa 2r _ 3 + q^ r _iVi 



Let us now see how repair of a systematic node is performed. Consider repair of node I. First, from nodes 
{1, . . . , k + 1}\{£}, all the data in the last (r — 2) substripes is downloaded and the data a r , . . . , a2 r _3 is recovered. 
This also provides us with the desired data {a r> g, . . . , a,2 r -3,e}- Next, observe that in each parity node {k+2, . . . , k+ 
r}, there is precisely one 'q' vector that has a non-zero first component. From each of these nodes, the symbol having 
this vector is downloaded, and the components along {a r , . . . , a2 r -3} are subtracted out. Further, we download 
all symbols from all other systematic nodes in the same set as node t, and subtract this out from the previously 
downloaded symbols. This leaves us with (r — 1) independent linear combinations of {ai/, . . . , a r _i^} from which 
the desired data is decoded. 



When k is not a multiple of (r — 1), the k systematic nodes are divided into (r — 1) sets as follows. Let 



r — 1 



t h 



k 



1 



t = (k - (r - l)t e ) . 



(5) 



The first t sets are chosen of size th each and the remaining (r — 1 — t) sets have size te each. The systematic 
symbols in the first (r — 1) substripes are piggybacked onto the parity symbols (except the first parity) of the last 
(r — 1) stripes. For repair of any failed systematic node £ G {1, . . . , k}, the last (r — 2) substripes are decoded 
completely by reading the remaining systematic and the first parity symbols from each. To obtain the remaining 
(r — 1) symbols of the failed node, the (r — 1) parity symbols that have piggyback vectors (i.e., q's and v's) with a 
non-zero value of the £ th element are downloaded. By design, these piggyback vectors have non-zero components 
only along the systematic nodes in the same set as node I. Downloading and subtracting these other systematic 
symbols gives the desired data. 

The average data-read and download 72 YS for repair of systematic nodes, as a fraction of the total message symbols 
2k, is 

1 f [t((r - 2)k + (r - l)t h ) + (k- t)((r - 2)k + (r - 1)^)] . (6) 



sys 
72 



2k 2 ' 



This quantity is plotted in Fig. Bal for various values of the system parameters n and k. 
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While we only discussed the repair of systematic nodes for this code, the repair of parity nodes can be made 

is 



efficient by considering m instances of this code. A procedure analogous to that described in Section III-B 



followed, where the odd instances are piggybacked on to the succeeding even instances. As in Section III-B higher 
a value of m results in a lesser amount of data-read and download for repair. In such a design, the average data-read 
and download 7^ for repair of parity nodes, as a fraction of the total message symbols, is 



par 

72 



1 r - 
r 2r 



1 



(m + l)(r - 2)k + (m - l)(r - 2)(r - 1) + 



rn ' 
~2 



k + 



m 
~2 



(r-1) 



(V) 



This quantity is also plotted in Fig. Bb] for various values of the system parameters n and k. 



V. Piggybacking Design 3 

In this section, we present a piggybacking design to construct MDS codes with a primary focus on the locality 
of repair. The locality of a repair operation is defined as the number of nodes that are contacted during the repair 
operation. The codes presented here perform the efficient repair of any systematic node with the smallest possible 
locality for any MDS code, which is equal to (k + 1). |^The amount of read and download is the smallest among 
all known MDS codes with this locality, when (n — k) > 2. 

This design involves two levels of piggybacking, and these are illustrated in the following two example construc- 
tions. The first example considers a = 2m instances of the base code and shows the first level of piggybacking, for 
any arbitrary choice of m > 1. Higher values of m result in repair with a smaller read and download. The second 
example uses two instances of this code and adds the second level of piggybacking. We note that this design deals 
with the repair of only the systematic nodes. 

Example 4: Consider any (n = 11, k = 8) MDS code as the base code, and take 4 instances of this code. Divide 
the systematic nodes into two sets as follows, S\ = {1,2,3,4}, S2 = {5,6,7,8}. We then add the piggybacks as 
shown in Fig. |3] Observe that in this design, the piggybacks added to an even substripe is a function of symbols 
in its immediately previous (odd) substripe from only the systematic nodes in the first set S\, while the piggybacks 
added to an odd substripe are functions of symbols in its immediately previous (even) substripe from only the 
systematic nodes in the second set S2. 

2 A locality of k is also possible, but this necessarily mandates the download of the entire data, and hence we do not consider this option. 
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p^b+ai + a 2 


T 
P2 C 


+b 5 + b e 


P2 d+ci + c 2 


Node 11 


T 
P3 a 


p^b+a 3 + a 4 


T 
P3 C 
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Fig. 3: Example illustrating first level of piggybacking in design 3. The piggybacks in the even substripes (in blue) are a 
function of only the systematic nodes {1, . . . , 4} (also in blue), and the piggybacks in odd substripes (in green) are a function 
of only the systematic nodes {5, . . . , 8} (also in green). This code requires an average data-read and download of only 71% 
of the message size for repair of systematic nodes. 
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We now present the algorithm for repair of any systematic node. First consider the repair of any systematic 
node i € {1, ...,4} in the first set. For instance, say 1 = 1, then {p2, . . . ,b$, p^fb} and {d2, ■ ■ ■ ,d$, Pi^d} 
are downloaded, and {b, d} (i.e., the messages in the even substripes) are decoded. It now remains to recover 
the symbols a\ and c\ (belonging to the odd substripes). The second symbol (p^b + a% + CI2) from node 10 is 
downloaded and p^b subtracted out to obtain the piggyback (a\ + 02). Now a\ can be recovered by downloading 
and subtracting out a-i- The fourth symbol from node 10, (p^d + ci +02), is also downloaded and p^d subtracted 
out to obtain the piggyback (c\ + C2). Finally, c\ is recovered by downloading and subtracting out C2- Thus, node 
1 is repaired by by reading a total of 20 symbols ( in comparison, the total total message size is 32 ). The repair 
of node 2 can be carried out in an identical manner. The two other nodes in the first set, nodes 3 and 4, can be 
repaired in a similar manner by reading the second and fourth symbols of node 1 1 which have their piggybacks. 
Thus, repair of any node in the first group requires reading and downloading a total of 20 symbols. 

Now we consider the repair of any node i £ {5, . . . , 8} in the second set S2. For instance, consider i = 5. 
The symbols {ai, . . . , a%, pj a} \{as}, {c\, . . . , eg, pf c} \{c^} and {di, . . . , d%, p± d} are downloaded in 

order to decode 05, cs, and cfe, From node 10, the symbol (p^c + 65 + 66) is downloaded and p^c is subtracted 
out. Then, 65 is recovered by downloading and subtracting out b&. Thus, node 5 is recovered by reading a total of 
26 symbols. Recovery of other nodes in S2 follows on similar lines. 

The average amount of data read and downloaded during the recovery of systematic nodes is 23, which is 71% of 
the message size. A higher value of m (i.e., a higher number of substripes) would lead to a further reduction in the 
read and download (the last substripe cannot be piggybacked and hence mandates a greater read and download; 
this is a boundary case, and its contribution to the overall read reduces with an increase in m). 

Example 5: In this example, we illustrate the second level of piggybacking which further reduces the amount 
of data-read during repair of systematic nodes as compared to Example [?] Consider a = 8 instances of an 
(n = 13, k = 10) MDS code. Partition the systematic nodes into three sets S\ = {1, . . . , 4}, S2 = {5, . . . , 8}, 
63 = {9, 10} (for readers having access to Fig. |4] in color, these nodes are coloured blue, green, and red 
respectively). We first add piggybacks of the data of the first 8 nodes onto the parity nodes 12 and 13 exactly as 
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Fig. 4: An example illustrating piggyback design 3, with k = 10, n = 13, a = 8. The piggybacks in the first parity node 
(in red) are functions of the data of nodes {8, 9} alone. In the remaining parity nodes, the piggybacks in the even substripes 
(in blue) are functions of the data of nodes {1, ... ,4} (also in blue), and the piggybacks in the odd substripes (in green) are 
functions of the data of nodes {5, . . . , 8} (also in green), and the piggybacks in red (also in red). The piggybacks in nodes 12 
and 13 are identical to that in Example [4] (Fig [3]). The piggybacks in node 11 piggyback the first set of 4 substripes (white 
background) onto the second set of number of substripes (gray background) 
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done in Example |4] ( see Fig. |?]). We now add piggybacks for the symbols stored in systematic nodes in the third 
set, i.e., nodes 9 and 10. To this end, we parititon the 8 substripes into two groups of size four each (indicated 
by white and gray shades respectively in Fig. [?]). The symbols of nodes 9 and 10 in the first four substripes are 
piggybacked onto the last four substripes of the first parity node, as shown in Fig. ^\(in red color). 

We now present the algorithm for repair of systematic nodes under this piggyback code. The repair algo- 
rithm for the systematic nodes { 1 , . . . , 8} in the first two sets closely follows the repair algorithm illustrated 
in Example [4] Suppose £ G Si, say £ = 1. By construction, the piggybacks corresponding to the nodes in 
Si are present in the parities of even substripes. From the even substripes, the remaining systematic symbols, 
{bi, di, fi, foj}i={2,...,io}> an d me symbols in the first parity, {pf b, p^d, p^f + bg + bio, Pi h + dg + dio}, are 
downloaded. Observe that, the first two parity symbols downloaded do not have any piggybacks. Thus, using the 
MDS property of the base code, b and d can be decoded. This also allows us to recover pjf, pfh from the 
symbols already downloaded. Again, using the MDS property of the base code, one recovers f and h. It now 
remains to recover {oi, ci, ei, gi}. To this end, we download the symbols in the even substripes of node 12, 
{p^b+ai + a2, p^d+ci + C2, p^f+ei + e%, p<fh+gi + §2}, which have piggybacks with the desired symbols. 
By subtracting out previously downloaded data, we obtain the piggybacks {ai + a2, ci +C2, &i + €2, 51+52}- Finally, 
by downloading and subtracting 02,02,62,52, we recover ai,ci,ei,gi. Thus, node 1 is recovered by reading 48 
symbols, which is 60% of the total message size. Observe that the repair of node 1 was accomplished by downloading 
data from only (k + 1) = 11 other nodes. Every node in the first set can be repaired in a similar manner. Repair of 
the systematic nodes in the second set is performed in a similar fashion by utilizing the corresponding piggybacks, 
however, the total number of symbols read is 64 ( since the last substripe cannot be piggybacked; such was the case 
in Example^as well). 

We now present the repair algorithm for systematic nodes {9, 10} in the third set S3. Let us suppose £ = 9. 
Observe that the piggybacks corresponding to node 9 fall in the second group (i.e., the last four) of substripes. 
From the last four substripes, the remaining systematic symbols {ei, fi, gi, fot}i={i,...,8,io}> an d the symbols in the 
second parity {pf e, p^f + ei + e<i, p^g + fi + /2, pf h + 51 + 52, } are downloaded. Using the MDS property of 
the base code, one recovers e, f, g and h. It now remains to recover ag, bg, eg and dg. To this end, we download 
{pf e + 09 + aio, pf f + bg + 610, Pi g + eg + cio, pf h + dg + dio} from node 11. Subtracting out the previously 
downloaded data, we obtain the piggybacks {ag + aio, ^9 + &10, eg + cio, dg + dio}- Finally, by downloading 
and subtracting out {aio, ^10, c io, ^io}> we recover the desired data {09,69,09,^9}. Thus, node 9 is recovered by 
reading and downloading 48 symbols. Observe that the repair process involved reading data from only (k+1) = 11 
other nodes. Node 10 is repaired in a similar manner. 

For general values of the parameters, n, k, and a = 4m for some integer m > 1, we choose the size of the three 
sets Si, S2, and 53, so as to make the number of systematic nodes involved in each piggyback equal or nearly 
equal. Denoting the sizes of Si, S2 and S3, by ti, t%, and £3 respectively, this gives 



ti 



1 



2r-l 



*2 



1 



*3 



1 



2r- 1 



(8) 



, 2r- 1 

Then the average data-read and download 7g ys for repair of systematic nodes, as a fraction of the total message 
symbols 4mk, is 
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(9) 



This quantity is plotted in Fig. 5a for various values of the system parameters n and k. 



VI. Comparison of different codes 



We now compare the average data-read and download entailed during repair under the piggyback constructions 
with various other storage codes in the literature. As discussed in Section [T| practical considerations in data centers 
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require the storage codes to be MDS, high-rate, and have a small number of substripes. The table below compares 
different explicit codes designed for efficient repair, with respect to whether they are MDS or not, the parameters 
they support and the number of substripes. Shaded cells indicate a violation of the aforementioned requirements. 
The parameter m associated to the piggyback codes can be chosen to have any value m > 1. The base code for 
each of the piggyback constructions is a Reed-Solomon code pi]. 



Code 



High-rate Regenerating [7|, |9| 
Product-Matrix MSR [5] 



Local Repair |20|-|23| 
Rotated RS |8| 
EVENODD, RDP [25|-[28| 
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Piggyback 2 
Piggyback 3 
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The piggyback, rotated-RS, (repair-optimized) EVENODD and RDP codes satisfy the desired conditions. Fig. [5] 
shows a plot comparing the repair properties of these codes. The plot corresponds to the number of substripes 
being 8 in Piggyback 1 and Rotated-RS, 4(2r — 3) in Piggyback 2, and 16 in the Piggyback 3 codes. We observe 
from the plot that piggyback codes require a lesser (average) data-read and download as compared to Rotated-RS, 
(repair-optimized) EVENODD and RDP. 



VII. Repairing parities in existing codes that address only systematic repair 



Several codes proposed in the literature ||7J, (8j, |19|, 1 30 1 can efficiently repair only the systematic nodes, and 
require the download of the entire message for repair of any parity node. In this section, we piggyback these codes 
to reduce the read and download during repair of parity nodes, while also retaining the efficiency of repair of 
systematic nodes. This piggybacking design is first illustrated with the help of an example. 



Example 6: Consider the code depicted in Fig. 6a originally proposed in / |79| /. This is an MDS code with 
parameters (n = 4, k = 2), and the message comprises four symbols a±, 02, b\ and 62 over finite field F5, The 
code can repair any systematic node with an optimal data-read and download. Node 1 is repaired by reading and 
downloading the symbols ai, (3ai + 2b\ + a<i) and (3ai + 4&i + 202) from nodes 2, 3 and 4 respectively; node 2 
is repaired by reading and downloading the symbols b±, (bi + 2<i2 + 362) an d (b\ + 2a2 + 62) from nodes 1, 3 and 
4 respectively. The amount of data-read and downloaded in these two cases are the minimum possible. However, 
under this code, the repair of parity nodes with reduced data-read has not been addressed. 



In this example, we piggyback the code of Fig. 6a to enable efficient repair of the second parity node. In 



particular, we take two instances of this code and piggyback it in a manner shown in Fig. 6b This code is obtained 
by piggybacking on the first parity symbol of the last two instance, as shown in Fig. In this piggybacked code, 
repair of systematic nodes follow the same algorithm as in the base code, i.e., repair of node 1 is accomplished by 
downloading the first and third symbols of the remaining three nodes, while the repair of node 2 is performed by 
downloading the second and fourth symbols of the remaining nodes. One can easily verify that the data obtained 



in each of these two cases is identical to what would have been obtained in the code of Fig. 6a in the absence of 
piggybacking. Thus the repair of the systematic nodes remains optimal. Now consider repair of the second parity 
node, i.e., node 4. The code (Fig. 6a\, as proposed in [19], would require reading 8 symbols (which is the size of the 
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(12,10) (14,12) (15,12) (12,9) (16,12) (20,15) (24,18) (12,10) (14,12) (15,12) (12,9) (16,12) (20,15) (24,18) 

Code Parameters: (n, k) Code Parameters: (n, k) 

(a) Systematic (b) Parity 




(12,10) (14,12) (15,12} (12,9) (16,12) (20,15) (24,18) 

Code Parameters: (n, k) 
(c) Overall 



Fig. 5: Average data-read and download for repair of systematic, parity, and all nodes in the three piggybacking designs, 
Rotated-RS codes (8), and (repair-optimized) EVENODD and RDP codes (25), (26). The (repair-optimized) EVENODD and 
RDP codes exist only for (n — k) = 2, and the data-read and download required for repair are identical to that of a rotated-RS 
codes with the same parameters. While the savings plotted correspond to a relatively small number of substripes, an increase 
in this number improves the performance of the piggybacked codes. 
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(b) Piggybacking to also optimize repair of parity nodes 



Fig. 6: An example illustrating piggybacking to perform efficient repair of the parities in an existing code that originally 
addressed the repair of only the systematic nodes. See Example [6] for more details. 
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entire message) for this repair. However, the piggybacked version of Fig. ^6b^can accomplish this task by reading and 
downloading only 6 symbols: c\, c<i, d\, c?2. (3ci+2di+C2+3ai+46i+2a2) and (di+2c2+3<i2+&i+2a2+&2)- Here, 
the first four symbols help in the recovery of the last two symbols of node 4, (3ci +4d\ +2C2) and (d\ +2c2 + (^2). 
Further, from the last two downloaded symbols, (3ci + 2d± + C2) and (di + 2c2 + 3^) can be subtracted out ( using 
the known values of c\, C2, d\ and d^) to obtain the remaining two symbols (3ai +4&i +2a2) and (b\ +202 + ^2)- 



Finally, one can easily verify that the MDS property of the code in Fig. 6a carries over to Fig. ^bj as discussed in 
Section [77] 

We now present a general description of this piggybacking design. We first set up some notation. Let us assume 
that the base code is a vector code, under which each node stores a vector of length fi (a scalar code is, of course, 
a special case with fi = 1). Let a = [&[ slJ • • • af\] T be the message, with systematic node i (e {1, . . . , k}) 
storing the ji symbols &J . Parity node (k + j), j G {l,...,r}, stores the vector a T Pj of p, symbols for some 



(k/j, x fx) matrix Pj. Fig. 7a illustrates this notation using two instances of such a (vector) code. 
We assume that in the base code, the repair of any failed node requires only linear operations at the other nodes. 



More concretely, for repair of a failed systematic node i, parity node (k + j) passes a T PjQ^ for some matrix 



Qf. 

The following lemma serves as a building block for this design. 

Lemma 1: Consider two instances of any base code, operating on messages a and b respectively. Suppose there 
exist two parity nodes [k + x) and (k + y), a (/i X p) matrix R, and another matrix S such that 

RQ^=QfS Vi€{l,...,fc}. (10) 

nv 'i I-,! .riri.k.i, 4 In ill,-, t-,.11-1 

yR) 

Proof: Consider repair of any systematic node i G {1, . . . , k}. In the piggybacked code, we let each node pass 
the same linear combinations of its data as it did under the base code. This keeps the amount of read and download 
identical to the base code. Thus, parity node (k + x) passes & T P X Q X ^ and (h T P x + a 7 P y R)Q x \ while parity 
node (k + y) passes a T P y Q$ and h T P y Qy i] . From {TO]) we see that the data obtained from parity node {k + y) 
gives access to & T P y Qy^ S = P y RQ x . This is now subtracted from the data downloaded from node (k + x) to 
obtain h T P x Qx\ At this point, the data obtained is identical to what would have been obtained under the repair 
algorithm of the base code, which allows the repair to be completed successfully. ■ 



Then, adding & T P y R as a piggyback to the parity symbol b T P x of node (k + x) (i.e., changing it from h T P x to 
(h T P x + & T P y R)) does not alter the amount of read or download required during repair of any systematic node. 



An example of such a piggybacking is depicted in Fig. 7b 



Under a piggybacking as described in the lemma, the repair of parity node (k + y) can be made more efficient 
by exploiting the fact that the parity node (k + x) now stores the piggybacked symbol ih T P x + SL T P y R). We now 
demonstrate the use of this design by making the repair of parity nodes efficient in the explicit MDS 'regenerating 
code' constructions of |7}, j9j, [19|, (30} which address the repair of only the systematic nodes. These codes have 



the property that 

Q« = Q l Vt€{l,. ..,*}, Vx€{l,...,r} 

i.e., the repair of any systematic node involves every parity node passing the same linear combination of its data 
(and this linear combination depends on the identity of the systematic node being repaired). It follows that in these 
codes, the condition ( fTO] ) is satisfied for every pair of parity nodes with R and S being identity matrices. 

Example 7: The piggybacking of (two instances) of any such code /[7|/, p9^ , 1^30^ is shown in Fig. 7c_ (for 



the case r = 3 ). As discussed previously, the MDS property and the property of efficient repair of systematic nodes 
is retained upon piggybacking. The repair of parity node (k + 1) in this example is carried out by downloading 
all the 2kpL symbols. On the other hand, repair of node (k + 2) is accomplished by reading and downloading b 
from the systematic nodes, (h T Pi + a r P2 + gFPz) from the first parity node, and (a T P^) from the third parity 
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(a) Two instances of the vector base 
code. 
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(b) Illustrating the piggybacking 
stated in Lemma [T] The parities 
(k + 1) and (k + 2) respectively 
correspond to (k + x) and (k + y) 
of the Lemma. 
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(c) Piggybacking the 'regenerating code' constructions 
of (7), (19), (30) for efficient parity repair 



Fig. 7: Piggybacking for efficient parity-repair in existing codes originally constructed for repair of only systematic nodes. 



node. This gives the two desired symbols a T P 2 and b T P 2 . Repair of the third parity is performed in an identical 
manner, except that a T P 2 is downloaded from the second parity node. The average amount of download and read 
for the repair of parity nodes, as a fraction of the size k\x of the message, is thus 

2k + 2 
3k 

which translates to a saving of around 33%. 

In general, the set of r parity nodes is partitioned into 

9 



sets of equal sizes (or nearly equal sizes if r is not a multiple of g). Within each set, the encoding procedure of 
Fig. 7c is performed separately. The first parity in each group is repaired by downloading all the data from the 
systematic nodes. On the other hand, as in Example [7] the repair of any other parity node is performed by reading 
b from the systematic nodes, the second (which is piggybacked) symbol of the first parity node of the set, and the 
first symbols of all other parity nodes in the set. Assuming the g sets have equal number of nodes (i.e., ignoring 
rounding effects), the average amount of read and download for the repair of parity nodes, as a fraction of the size 
kfi of the message, is 

1 k + ( r --l) 2 



+ 



2A: 



VIII. Conclusions And Open Problems 

We present a new piggybacking framework for designing storage codes that require low data-read and download 
during repair of failed nodes. This framework operates on multiple instances of existing codes and cleverly adds 
functions of the data from one instance onto the other, in a manner that preserves properties such as minimum 
distance and the finite field of operation, while enhancing the repair-efficiency. We illustrate the power of this 
framework by using it to design the most efficient codes (to date) for three important settings. In the paper, we also 
show how this framework can enhance the efficiency of existing codes that focus on the repair of only systematic 
nodes, by piggybacking them to also enable efficient repair of parity nodes. 

This simple-yet-powerful framework provides a rich design space for construction of storage codes. In this paper, 
we provide a few designs of piggybacking and specialize it to existing codes to obtain the four specific classes of 
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code constructions. We believe that this framework has a greater potential, and clever designs of other piggybacking 
functions and application to other base codes could potentially lead to efficient codes for various other settings as 
well. Further exploration of this rich design space is left as future work. Finally, while this paper presented only 
achievable schemes for data-read efficiency during repair, determining the optimal repair-efficiency under these 
settings remains open. 
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Appendix 



Proof of Theorem yf| Let us restrict our attention to only the nodes in set S, and let \S\ denote the size of this 
set. From the description of the piggybacking framework above, the data stored in instance j (1 < j < a) under 
the base code is a function of Uj. This data can be written as a 1 5 1 -length vector i(Uj) with the elements of this 
vector corresponding to the data stored in the | *S' | nodes in set S. On the other hand, the data stored in instance 
j of the piggybacked code is of the form (i(Uj) + gj(Ui, • • • , Uj-i)) for some arbitrary (vector-valued) functions 
'g'. Now, 



I ({Yil 



,U a ) = l({i{U j )+ Ej {U 1 ,...,U j - X )}^ =l ; U u ...,U a ) 



= ^/({f^O+g^f/!,...,^!)}^; U t 

i=\ 

a 

= J2 1 ( f (^)> m)+gi(tfi,---,^-i)} 

a 

> J2l(f(U e ) ; Ut | Ux,..., U^) 



a 

j=l+l > 



Ux, 



a 

l{{Xi} ieS ;Ux,...,U a ) 



(11) 
(12) 

,U t -i) (13) 
(14) 
(15) 



(16) 



where the last two equations follow from the fact that the messages Ue of different instances £ are independent. 



